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ABSTRACT 

Using both halo model calculations and a large sample of simulated SZ maps, we demonstrate that 
high-mass clusters add significant non-Gaussian variance to measurements of the SZ power spectrum 
amplitude. The difhculty in correctly accounting theoretically for the contribution of these objects to 
the uncertainty in C'i leads to a reduced sensitivity to erg. We show that a simple solution is to mask 
out the brightest clusters in the map before measuring the power spectrum. We demonstrate that 
fairly conservative masking can reduce the variance and Gaussianize the statistics significantly, thus 
increasing the sensitivity to cosmological parameters. Choosing which objects to mask is non-trivial; 
we found that using a fixed sky density produced a well-defined and well-behaved estimate that can 
easily be applied to real maps. For example, masking the 10 (90) brightest clusters in a 100 deg^ SZ 
map will improve the sensitivity to Ci by a factor of two a,t £ ~ 1000 (2000) and 1.5 at £ — 2000 
(4000). We show that even in the presence of astrophysical foregrounds (primary CMB and point 
sources) and instrument noise, one can increase the precision on measurements of erg by masking up 
to 0.9 clusters deg~^. 

Subject headings: galaxies: clusters: general — cosmic microwave background - intergalactic medium 
— cosmology: theory - methods: N-body simulations 



1. INTRODUCTION 



The Sunyaev-Zel'dovich (SZ) effect has long been rec- 
ognized as a powerful tool for probing the physics of 
the intra-cluster medium, large-scale st ructure forniation 
and the dark energy e quation of state (|Birkinshawlll999l : 
ICarlstroin et al.ll2002f). Simple but robust analytical ar- 



guments (jBarbosa et al.ll996HHolder fc C arlstrom''200l') 
and hydrodynaniical si r nulations of clusters (White et al. 
120021: iMotl et al.l 120051 : lNagaill2006l ) have indicated that 
the integrated SZ flux (the integral of the temperature 
decrement across the surface area of a cluster) should cor- 
relate tightly with cluster mass. Combined with a greater 
sensitivity than optical or X-ray surveys to high red- 
shift objects, this makes SZ-selected cluster samples well 
suited for measuring the evolution of the cluster mass 
function over a wide range in redshift. Experiments such 
as the Atacama Cosmology Telescope (Kosowskvl l2003l) 
and the South Pole Telescope (|Ru hl et al. 2004) are cur- 
rently surveying the microwave sky to develop large cat- 
alogs of galaxy clusters that are uniformly selected by 
the SZ flux. 

However, in order to use cluster samples for this pur- 
pose, it is necessary to have a good unde rstanding of 
both the selection ifunction of the survey (|Melin et al.l 
120061 ISchafer fc BartelmannI [2001 and of the mappmg 
between the measured integrated SZ flux and cluster 
mass (and how this evolves with redshift). Besides the 
slope and normalization of the flux-mass (Y-M) rela- 
tion, one must also gain some measure of the scat- 
ter around the mean relation in order to be able to 
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differentiate between d i fferent dark energ y cosmologies 
(|Battve fc Welled 120031 : iLima fc Hul l2005l ) . A detailed 
characterization of the Y-M relation may require X-ray 
follow-up of a r epresentative sub-s ample of SZ-selected 
clusters ([Maium dar fc MohiJ |2003[ ) . Complete optical 
coverage of surveyed fields is necessary for obtaining clus- 
ter redshifts and to help determine the selection func- 
tion. Thus, although there is much power in this method, 
much work is required before it can be applied to its full 
potential. 

The SZ effect can also be detected as a secondary 
anisotropy in the CMB temperature anisotropy power 
spectrum, appearing as 'excess power' (over the pre- 
dicted primary anisotropy signal) on angular scales below 
several arc-minutes. In principle, measuring the SZ an- 
gular power spectrum is an easier task than serendipitous 
cluster detection as one is searching for temperature fluc- 
tuations (on a given angular scale) to which clusters over 
a wide mass and redshift range contribute signiflcantly, 
without the need to resolve individual clusters at high 
signal-to-noise. 

Predictions for the SZ power spectrum amplitude C^^sz 
(henceforth C j) can be made using the halo model (e.g. 
ICooravll2001h and estimates of the radial pressure pro- 
file of intra-cluster gas. Assuming that the cluster 
gas resides in hydrostatic equilibrium in the potential 
well of the host dark matter halo (with an additional 
steepe ning of the gas density p rofile in the outer re- 
gions), [Komati^^&|silia3 (|2002f ) demonstrated that the 
ensemble averaged power spectrum amplitude Ci_ has 
an extremely sensitive dependence on cg, where Ci oc 
cr|(rit,/i)^. Although there remains a certain ambiguity 
in the exact amplitude and shape of the predicted signal 
due to uncertainties in cluster gas physics (and under- 
standing of the contribution of sub- mm and synchrotron 
sources to the power spectrum provides an added com- 
plication), the SZ angular power spectrum represents a 
robust observable with which competitive constraints on 
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(Tg can be obtained. 

Rather than attempting to resolve individual clusters, 
small-scale CMB experiments have focused on detecting 
SZ power at arc-minute scales and measuring the am- 
pHtude of Ci and thus CTg. BIMA surveyed 0.2 square 
degrees on the sky measuring a fluctuation amplitude 
of = Cd{t + l)/27r = 22Qt\fo^J'K^ , from which 

they inferred a value of ag = l.OStoJg (|Dawson et al.l 
|2006[) and CBI recently ineasur ed an excess of l.Gcr above 
(Tg = 0.8 (jSievers et al.l l200"9l ) . Clearly there is some 
tension between the value of erg inferred from these ex- 
periments and the current value of tig ^ 0.8 favored by 
WMAP and X-ray cluster n umber counts (jPunklev et al.l 
120091: IVikhhnin etaIll2009D. However, we note that re- 
cent results from SZA (ISharp et al] 120091) and QUaD 
(|Friedman fc QUaD Collaboration! I2OO90 measurements 
of the SZ power spectrum infer a value of cg that is more 
consistent with those derived from other methods. 

One possible explanation for this discrepancy is that 
the ICM gas models used to calculate the SZ power spec- 
trum do not include or correctly capture the relevant 
physics that determine the shape and amplitude of pres- 
sure profiles, resulting in a large modeling uncertainty in 
the a mplitude of Ci. Analytic arguments (iHolder et al. 
20001) and hydrodynam ical simulations (|da Silva et al. 
20011: iWhite et all |2002[ ) suggest that the inclusion of 
more detailed physics (radiative cooling, star formation 
and energy feedback) only moderately influence the am- 
plitude of the power spectrum. This is partly because 
these effects mostly influence the temperature and den- 
sity of gas in cluster cores, whereas it is the gas at larger 
radii that contributes most to Ci (iKomatsu fc SeliaE 
I2OO2I ). However, to date there has not yet been a system- 
atic comparison between simulated and analytic power 
spectra, partly due to the limited total area of SZ maps 
that can be produced from the hydrodynamical simu- 
lations, and so the actual magnitude of the theoretical 
uncertainty in Ci remains uncertain. 

A second explanation relates to the fact that the prob- 
ability density function for the field to field distribution 
of , P(Cf ), is inherently non-Gaussian, with a long tail 
towards high values of Ci. The significant non-Gaussian 
contribution is due to Poisson fluctuations in the sam- 
pling of the halo mass function for a given survey field. 
Hence, although the average power measured over 
many maps should reproduce the predicted halo model 
spectrum (assuming the cluster gas physics is correctly 
modeled) , the power measured in any single map may be 
several times greater than the ensemble-averaged value, 
if that field happens to contain several very massive 
clusters. These objects, while not contributing strongly 
to the mean SZ power spectrum drive the significant 
non-Gaussian contribution to V{Ci). IZhang fc ShethI 
(j2007f ) derived an analytic expression for P(C^) using the 
halo model approach and demonstrated that the degree 
of non-Gaussianity is most significant at large angular 
scales, where massive clusters contribute most of their 
signal. Until now, it has not been possible to generate a 
large enough sample of independent simulated SZ maps 
in order to test the analytic expressions. 

The main purpose of this paper is to demonstrate, us- 
ing both halo model calculations and a large sample of 
simulated SZ sky maps generated from a 'lightcone' sim- 



ulation, that the non-Gaussianities in P(C£) are indeed 
driven by shot noise due to the number of very high mass 
clusters in a given field, and that one can reduce this 
Poisson contribution to the total variance in by mask- 
ing these objects from sky maps. We compare the power 
spectra measured from our simulated maps with the halo 
model analytic predictions for Ci and the non-Gaussian 
contribution to P(C^). In Section [2] we describe the halo 
approach to calculating the SZ angular power spectrum 
and trispectrum, and demonstrate how masking clusters 
will increase the sensitivity of the measured to erg . In 
Section [3] we compare the analytic and simulated results 
for C^, P(C£) and ctnGj and demonstrate in practice the 
utility of masking the brightest clusters. In Section [4] 
we discuss the impact of astrophysical foregrounds and 
instrument noise on our results. 

Henceforth, in this paper cluster mass is measured 
within the spherical region Rmr , defined as the region en- 
closing a mean overdensity of Ayjr Pcrit- Ayjr is calculated 
using the fitting formula provided by iBrvan fc Normanl 
(|1998f ). The fiducial assumed cosmology has parame- 
ters Q.m = 0.27, Us = 0.96, Q.A = 0.73, erg = 0.77 and 
h = 0.72. 

2. SUNYAEV-ZEL'DOVICH ANGULAR POWER SPECTRUM 

The thermal SZ effect is a distortion of the CMB 
caused by inverse Compton scattering of CMB photons 
(at temperature Tcmb) off electrons (at T^) in the high 
temperature plasma within galaxy clusters. To first 
order, the temperature change at frequency v of the 
CMB is given by AT^/Tcmb = /^(a;)y, where f^{x) = 
a;(coth(a;/2) — 4), a; = hi>/kBTcMBi and y is the normal 
Compton parameter 

y - (^) f nS)TS)dl , (1) 
\ nieC- J J 

where the integral is along the line of sight. There is a 
host of higher order effects that can be important at the 
10% level, including bulk motions and relativistic effects 
(jNozawa et al.|[2006l ) . Unless stated otherwise we assume 
henceforth that fv{x) = —2, which is appropriate in the 
Ray leigh- Jeans limit. 

We can also calculate the SZ power spectrum, assum- 
ing that clusters are Poisson-distributed, by simply sum- 
ming up the squared Fourier-space SZ profiles, y{M, z, £) 
of all clusters: 

where V(z) is the comoving volume per steradian and 
n{M, z) is the number density of objects of mass M at 
redshift z. For the latter we use the fitting function of 
iJenkins et al.l (|2001h . while for the SZ profiles as a func- 
tion of mass and redshift we use either a simple /3-model 
(where y — ?/o(l + l9cY)~^) with self-sim ilar scaling 
for the central j/o (Holder fc Carlstrom 1999) or the hy- 
drostatic model of iKomatsu fc Seljak (2002) (henceforth 
referred to as the KS model) . We have extensively exper- 
imented with different profiles, with most prescriptions 
showing differences from each other at the level of tens 
of percent in the power for consistent assumptions about 
gas fractions and cosmological parameters. 

The dominant contribution to the non-Gaussian error 
comes from the shot noise in galaxy clusters. This Pois- 
son contribution to the variance at a single value of £ 
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Fig. 1. — Contour plot demonstrating the weight with which 
clusters of mass Mvir at redshift z contribute to the SZ power 
spectrum (black contours) and trispectrum (blue contours) at £ = 
2500. The three lines intersecting the contours are lines of constant 
integrated SZ flux (Y), each separated by a factor of 2. They 
demonstrate that one could mask the signal from clusters brighter 
that some threshold in Y to significantly reduce the trispectrum 
variance without removing a large fraction of the signal from the 
power spectrum. 

comes from the one-halo contribution to the trispectrum 
dCoora v 200 1; Komatsu & Seljak 2002; Zhang fc She th, 
|2007| ). For closely separated bins in I space, the variance 
in the SZ power spectrum is given by 

where the trispectrum due to shot noise is 

Henceforth, we will refer to the second term within the 
brackets in Eqn. [31 as the 'non-Gaussian' variance (or 
trispectrum), ct^q. We will refer to the first term as the 
'Gaussian' variance, CTq. We note that the ratio of these 
two quantities is independent of the observed map area 

fsky 

Figure[l]demonstrates how clusters at a given mass and 
redshift contribute to Ci and Tti a.t £ = 2500 and for erg = 
0.77 (assuming a /3-model for the gas pressure profiles). 
The black and blue contours represent equal contribution 
to Cf and Tu, respectively. Clusters in the redshift range 
0.3 < z < 0.6 and of mass 4 x IQii/j-iAfQ contribute 
most to the power spectrum, whereas the trispectrum 
is driven by clusters at approximately the same redshift 
but with more than twice the mass. Since the SZ flux 
scales as (mass)^^^ , the trispectrum is clearly skewed 
toward much larger masses than the power spectrum. 
This leads to a significant amount of non-Gaussianity in 
the statistics of the SZ power spectrum being driven by 
the most massive objects. 

It is clear from Figure[l]that although the most massive 
clusters do not strongly contribute to the mean power 
spectrum at £ — 2500, they dominate the trispectrum. 
These clusters are a significant source of noise in any 
measurement of Ci. Therefore, if one simply clips the 
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Fig. 2. — (top) The SZ angular power spectrum without masking 
clusters (black), and masking out the brightest 0.01, 0.1 and 0.9 
clusters per square degree (blue, green and red lines). The solid 
and dashed lines show the results for ag = 0.77 and 0.80, respec- 
tively, (middle, top) The difference in £(i + l)C^/27r between the 
two values of erg for each (un)masked case, (middle, bottom) The 
standard deviation of the canonical distribution of C(, as given 
by Eqn. [3] Note that the value of cr plotted here is calculated 
assuming full sky coverage (/sky = 1) and for ""8 = 0.77. The 
dotted lines shows the contribution of the purely Gaussian compo- 
nent (cQ, see Eqn. [S}. (bottom) The sensitivity of the SZ power 
spectrum to a change of 0.03 in erg, Aci/a(Ci), having masked 
out clusters divided by the sensitivity for the unmasked case. This 
panel demonstrates that masking the brightest clusters down to 
a surface density of 0.9 clusters deg~-^ improves the sensitivity of 
the SZ power spectrum to erg by a factor of 3.2 (2.0) at <? = 1000 
(4000). 

most massive objects from the map, the field-to-field vari- 
ance of Ce will be reduced and the statistics will become 
more Gaussian. The three lines intersecting the contours 
in Figure [U are lines of constant integrated flux (Y), each 
separated by a factor of 2. They represent thresholds 
above which one could mask clusters so that the impact 
of the trispectrum (or non-Gaussian variance) is greatly 
suppressed, while not removing a significant amount of 
signal from a power spectrum measurement. This then 
allows for a more precise measurement of the 'true' _or 
ensemble-averaged SZ power spectrum amplitude, C^, 
and thus on erg. 

Figure [2] demonstrates the improvement in sensitivity 
to (Ts when clusters above a given threshold are masked 
out of the analysis. We define our threshold in terms of a 
cumulative surface density on the sky, such that all bright 
clusters down to surface density nth (deg^^) are masked 
out from the map. We adopt a surface density threshold 
rather than an integrated flux threshold so that the num- 
ber of clusters masked is relatively insensitive to cts and 
modeling of cluster gas physics (see discussion below). 
In practice we have chosen three values of nth: 0.01, 0.1 
and 0.9 deg^^. SPT has already demonstrated that clus- 
ters can be detected at > 5a significance do wn to the 
0.1 deg-2 threshold ()Staniszewski et al.l[200i ). and both 
ACT and SPT are predicted to detect clusters down to 
the 0.9 deg~^ threshold. Therefore, it should be possi- 
ble for both these experiments to identify and mask or 
remove clusters down to the thresholds that we examine 
here. 
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The top panel in Figure [2] shows the SZ angular power 
spectrum for no masking (black) and masking down to 
0.01, 0.1 and 0.9 clusters per square degree (blue, green 
and red lines). The solid and dashed lines show the re- 
sults for (jg = 0.77 and 0.80, respectively. As expected, 
masking the brightest clusters primarily removes power 
at larger scales (£ < 3000). Removing the brightest 
0.01 clusters per square degree has little impact above 
i = 1000, however masking down to 0.1 (0.9) deg~^ re- 
duces the power spectrum at this scale by 33% (66%). 
It is clear that the change in power due to masking is 
greater than that due to the 0.03 change in erg, so the 
masking must be included in the theory prediction for 
the signal. 

Although the second panel indicates that an unmasked 
map has the greatest raw sensitivity to trgj the third 
panel demonstrates that the masked maps have much 
lower total variance, and are therefore more precise. The 
reduced variance more than compensates for the slight 
reduction in raw sensitivity, such that the ability to use 
the SZ power spectrum as a cosmological tool is enhanced 
by using maps with the brightest sources masked. This 
does not include the extra information encoded in these 
individually-detected clusters and is based solely on the 
information in the power spectrum. The bottom panel 
shows the improvement in true sensitivity (A(C£) / a{Ci)) 
having cut out bright clusters, relative to the uncut case. 
At I = 1000, masking 0.1 clusters/deg^ results in a fac- 
tor of 2.6 improvement in the cosmological sensitivity of 
the SZ power spectrum. At I — 3000 there is still a 50% 
improvement. 

We note that given full knowledge of the number den- 
sity of clusters and the thermal properties of the ICM 
over a wide range in mass and redshift, one could con- 
struct an optimized scheme for measuring the power 
spectrum amplitude by de- weighting the signal from high 
mass clusters. However, as one would already know the 
cluster mass function - which is more sensitivity to cos- 
mological parameters - the power spectrum would then 
contain no further cosmological information. In the ab- 
sence of such an optimal estimator, a simple solution is 
to apply zero weight (i.e. mask) the highest mass clus- 
ters from power spectrum measurements (to which they 
mostly contribute just noise) while utilizing them to sam- 
ple the cluster mass function, to which they contribute 
significant statistical information. 

We now investigate the difference between applying 
cuts defined by a surface density or integrated SZ flux 
threshold. In Figure [3] we plot the fractional error in 
the amplitude of the SZ power spectrum as a function of 
angular multipolc number (assuming full sky coverage). 
The black solid and dotted lines show the unmasked re- 
sults for (78 = 0.77 and 0.87 respectively. The solid blue, 
green and red curves show the results having applied our 
normal surface density cuts (nth = 0.01, 0.1 and 0.9 clus- 
ters deg~^) for CT8 — 0.77. The dotted lines show the 
fractional error for erg — 0.87 having applied exactly the 
same cut in flux. The dashed lines show the results for 
cTg = 0.87 having now used the surface density rather 
than flux thresholds. 

The plot demonstrates that using cuts in surface den- 
sity produces a fractional error that is approximately in- 
dependent of CTg- On the other hand, adopting a flux 
threshold means that the number of clusters masked - 
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Fig. 3. — The fractional error in the ampHtude of the SZ power 
spectrum as a function of angular multipole number (assuming full 
sky coverage). Solid lines show the results for erg = 0.77, having 
applied no masking (black), and masking the brightest clusters 
down to a surface density nth = 0.01 (blue), 0.1 (green) and 0.9 
(red) clusters deg~^. Dotted lines show the fractional error for 
(Tg = 0.87 having applied the same cut in flux, Y^_^{n = nth)- 
Dashed lines show the results for erg = 0.87 having applied surface 
density cuts rather than flux cuts. 

and thus cr(Q)/C^ - is highly sensitive to ag. Larger 
values of erg result in greater numbers of clusters with 
flux greater than a given value. This will ultimately make 
the statistical error on measurements of Ci more difhcult 
to determine. Furthermore, the effectiveness of applying 
flux cuts is also dependent on the accuracy with which 
one is able to measure cluster fluxes observationally, and 
the mass and redshift dependence of the intrinsic scatter 
in the M-Y relation. Using surface density thresholds 
neatly circumvents these issues. 

3. COMPARISONS WITH SIMULATIONS 

3.1. Simulations 

To date, it has proven difficult to compare the statis- 
tics of the SZ power spectrum as predicted by analytic 
calculations and numerical simulations as the typical box 
size of hydrodynamical simulations limits the number of 
independent maps that can be created, while also poorly 
sampling the extreme high-mass end of the halo mass 
function. Here we combine an N-body 'lightcone' simu- 
lation with a semi-analytic model for cluster gas to gen- 
erate a large number of maps and thus enable such a 
comparison. We also use our large sample of simulated 
maps to demonstrate the utility of masking clusters to 
improve the precision of measurements of the SZ power 
spectrum. 

To generate the simulated SZ maps, we begin with the 
output of a large {N = 1260"^ = 2 x 10^ particles) cos- 
mological dark matter simulation. Cosmological param- 
eters were chosen to be consistent with those measured 
from the 3rd- year WMA P data combined wi th large-scale 
structure observations (jSpergel et al.l [20071) . The simu- 
lated volume is a periodic cube of size L = 1500ft.~^Mpc; 
the particle mass rup — 1.22 x 10^^/i~^Mq and the cu- 
bic spline softening length e = 16.5ft.~^kpc. This sim- 
ulation was carried out with exactly the same methods 
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used for the simula tion described in iBode et all (|2007[ ) , 
iSehgal et all (|2007f ) and lShaw et~all (|2008D, so for a de- 
scription of the computational details we refer the reader 
to these papers. 

The very large box size of this simulation makes it 
ideal for generating a large area of simulated sky. The 
matter distribution in a light cone extending to z = 3 
was saved in 348 time slices. Dark matter halos in 
this light cone were identified with the Friends-of- Friends 
algorithm using a comoving linking length parameter 
5 = 0.2. In total, the lightcone covers a single octant 
on the sky (« 5000 deg^), containing over two niil- 
hon clusters with MpoF > 10^^/i~^Mq. For the pur- 
poses of the analysis here we retain only clusters with 
Mfof > 3 X 10"/i-1Mq. 

The cluster gas distribution in each halo was calculated 
using the se mi-analytic model d escribed in lOstriker et al.l 
(|2005f) and iBode et all (|2007D . In brief, a 3D mesh 
(with cell side-length 2e = 33h~^ kpc) is placed around 
each cluster, with the gas pressure and density deter- 
mined in each mesh cell assuming hydrostatic equilib- 
rium and a po lytropic equation of s tate (with adiabatic 
index F = 1.2. lAscasibar et"aLll2006[) . It is also assumed 
that the gas in the densest cluster regions has cooled and 
condensed, formin g stars. At z = 0, the stellar/gas mass 
ratio is set to 0.1 (jLin et al.ll2003l : IVoevodkin &: VikhlininI 
I2004D . To compute the star/gas ratio at z > 0, the star 
formation rate was as sumed to follow a delay ed exponen- 
tial model (Eqn. 1 of lNagamine et al]|2006f ). with decay 

time T = 1.5 Gyr. 

As discussed in detail in IBode et al.l ()2007[ ). the most 
important free parameter in this model is the energy in- 
put into the cluster gas via non-thermal feedback pro- 
cesses, such as AGN outflows and supernovae. This 
is set through the parameter e/, such that the feed- 
back energy is e/M*c^, where is the stellar mass 
in the cluster. It was demonstrated that setting £/ = 
3 X 10~^ provides good agreement between the M — T 
and fg — T scaling relations obtained from our simula- 
tion plus gas model cluster sample and those from X-ray 
observations (lArnaud et all 120051: iVikhlinin et al.l 120061: 
Gastaldello et al.ll2007l: lO'Hara et aLll2006l: [Zhang et all 
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From the output of our ICM model, we produce a 2-d 
'postage-stamp' y image for each cluster by summing up 
the electron pressure n^Te in each mesh cell along one d i- 
rection (for example, see Fig. 7 of lOstriker et al.|[2005D . 
We then construct a library of images for all clusters in 
the lightcone octant with M^ir > 2.5 x lO^^M©. Note 
that each postage-stamp is produced having rotated the 
3D mesh around the cluster so that one face lies par- 
allel to the plane of the lightcone 'sky'. SZ cluster sky 
maps are thus produced by projecting down the light- 
cone, summing up the contribution of all the clusters 
along the line of sight. In practice, we produce individ- 
ual SZ sky maps of size 5x5 degrees. We set the image 
resolution (pixel size) to 0.25 arcminutes. In order to al- 
low robust comparison with the analytic results, we also 
create a second set of simulated maps in which we re- 
place the gas model cluster 'postage-stamp' images with 
a /9-model profile with P — 1 and normalized in the same 
way as for the analytic calculation described in the pre- 
vious Section. We henceforth refer to the maps produced 




Fig. 4. — SZ angular power spectrum measured for each of the 25 
deg^ simulated maps for our BO sample (grey lines). Black error 
bars plot the la statistical errors around the mean value Ci (black 
solid line) for the sample. The blue error-bars denote the standard 
deviation expected from Gaussian cosmic variance alone (the first 
term in Eqn. [Sjl. The red line denotes the power spectra calculated 
using the halo model and KS gas profile. 



with the iBode et all ()2007l ) gas prescription as the BO 
gas model sample. In total we produce 96 25 deg^ maps 
(2400 deg^ in total of simulated sky) for each gas profile. 

3.2. Simulated SZ power spectrum and trispectrum 

In Figure [?] we plot the SZ angular power spectrum 
measured for each of the 25 deg^ simulated maps (grey 
lines) in our BO gas model sample. The solid black line 
represents the mean power spectrum measured for the 
BO gas model maps; the red solid line the halo model 
predictions for the KS gas model. The black error bars 
denote the la statistical errors around the mean value Ci 
for the BO map sample. The blue error-bars denote the 
standard deviation expected from Gaussian cosmic vari- 
ance alone (ctg: the first term in Eqn. [3]). The simulated 
power spectra are band-averaged in bins of A; — 300 and 
are apodized with a 2d Gaussian window function with 
unity variance. The analytic power spectra have been 
convolved with a Gaussian of FWHM 0.25 arcminutes 
in order to approximately account for the pixelisation 
of the simulated maps. We also note that the analytic 
power spectra are calculated omitting the contribution 
of clusters with Mvir < 2.5 x 1O^'^M0. This is because 
the simulated maps are only 100% complete down to this 
mass, and so we have omitted lower mass clusters from 
the simulated sky-maps. 

We note that the power spectra measured for the BO 
gas model simulations agree with the KS power spectra 
at very small angular scales {£ > 10, 000) but have signifi- 
cantly more power at larger scales. This additional power 
is most likely due to the impact of the feedback param- 
eter in the BO model. The additional energy added to 
the gas has the effect of heating and 'pufhng out' the gas, 
reducing the slope of the pressure profiles in the outer re- 
gions and increasing the large-scale signal. We also note 
that the KS model has a relatively steep SZ profile at 
larger radii, reducing the signal at larger scales. The 
full impact of non-gravitational heating sources on the 
ICM gas temperature and density profiles is currently 
poorly understood; the KS presciption is not astrophysi- 
cally well-motivated and there is no reason to expect the 
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Fig. 5. — Square root of the ratio of the non-Gaussian (trispec- 
trum) to Gaussian contributions to the total variance in C^, 
cf^q/cfq, as a fuction of angular multipole number. Heavy solid 
and dashed lines represent the /3-model and BO gas model sim- 
ulated maps, respectively. Light solid and dashed lines show the 
results from the halo-model calculation assuming /3-model and KS 
gas profiles. 

model to match either the BO gas model or real clusters 
at high redshift. Hence there remains a large theoreti- 
cal uncertainty in Cg which must be accounted for when 
attempting to constrain erg via the SZ power spectrum. 
The exact range of uncertainty in Cg between different 
models and simulations has not been quantified and we 
leave this issue to future work. We have confirmed that 
the power spectra measured from our sample of /3-niodel 
maps matches the equivalent predictions from the halo 
model approach. 

As demonstrated by the error bars in Figure [H the 
total field-to-field variation in the power spectrum am- 
plitude is several times greater than the Gaussian error, 
with the factor decreasing with increasing I. This is due 
to Poisson noise associated with the discrete sampling 
of the halo mass function in each field. Comparing the 
individual simulated power-spectra to the error-bars, it 
is also clear that the distribution of Ci is highly non- 
gaussian, especially for i < 6000. The power spectra 
for individual maps (grey lines) indicate that the non- 
gaussian_variance is being driven by fields with Ce » Cg 
(where Cg represents the mean), corresponding to fields 
that contain at least one very massive cluster. 

In Figure [5] we plot the ratio of the non-Gaussian and 
Gaussian contributions to the total standard deviation 
in the statistical distribution of Cg, (Tnc/cG (the second 
term divided by first term in Eqn. [31), as a function of 
angular multipole number. The total variance of P{Cg) 
is therefore just the sum of a^Q and CTq, although we 
reiterate P{Cg) is a strongly non-gaussian function at 
larger scales and thus the total variance (and mean) does 
not completely define this distribution, ct^q is measured 
from the simulated maps by calculating the variance in 
Cg for each set of maps, multiplying this by /sky and 
deducting the Gaussian variance, calculated using Cg. 
The heavy lines represent the results for our simulated 
/3-model (solid) and BO gas model (dashed) maps. The 
thin lines are the equivalent results for the /3-model and 
KS halo model calculations, respectively. 

Comparing thick and thin solid black lines, we find 
that the non-Gaussian error measured for our set of sim- 
ulated /3-model maps is approximately one third greater 




£ 

Fig. 6. — (Upper) The mean angular power spectrum measured 
for our WMAP 3 (solid) and 5 (dashed) samples of simulated maps 
(BO gas model), having applied no masking (black), and masking 
the brightest clusters down to 0.01, 0.1 and 0.9 clusters deg~^ 
(blue, green and red lines). (Lower) The ratio of the sensitivity 
of the SZ power spectrum to the difference in ug between our two 
simulations having masked clusters to that with no masking. 

than that predicted by the trispectrum. As we find a 
good agreement between the simulated and predicted 
power spectra, this discrepency must be due to effects 
that increase the width of P{Cg) without influencing 
strongly the ensemble-averaged mean. One such effect is 
the non-linear bias, or small scale halo clustering, which 
is not accoun ted fo r in the trispectrum . As n oted in 
Section [2] and ICoorav (2001); Komatsu fc Seljakl ([2002); 
IZhang fc ShethI ()2007f ). the two halo (and higher-order) 
contribution to the trispectrum - which accounts for the 
linear bias - is insignificant, including the two- halo term 
increases CTnc by only a few percent. We have found that 
by remeasuring a^o for both sets of simulated maps hav- 
ing first randomised the halo positions reduces crnc/cG 
such that the simulated /3-model line becomes into better 
agreement with that of the trispectrum result, although 
remains above the predicted curve at all £. 

The thick and thin dashed lines in Figure[5]show the ra- 
tio of the non-Gaussian to Gaussian variance for BO gas 
model simulations and the analytic KS model. The sim- 
ulations demonstrate a shallower dependence on £ than 
the halo model calculation. Given that ctq oc Cg, the 
flatness of the simulated curve is likely due to the larger 
mean power simulated spectrum in the simulated maps 
compared to the KS model predictions. We note again 
however that there is no reason why an agreement is ex- 
pected. Indeed, as the simulation results include large 
variations in the underlying dark matter halo structure 
(including morphology and substructure) we would ex- 
pect (T„g/o'g to be greater, especially at small angular 
scales. 

3.3. Masking Clusters 

In Section [5] we showed that the primary contribu- 
tion to the SZ power spectrum comes from clusters oc- 
cupying a different region of the cluster mass-redshift 
plane to those that determine the trispectrum. Clus- 
ters at 2; « 0.4 and of mass 4 x IO^^/i^^Mq contribute 
most strongly to the power spectrum, whereas clusters 
at 1 — 2 X 10^^/i~^Mq are the foremost contributors to 
the trispectrum. Therefore, if one wishes to measure ac- 
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curately the mean SZ power spectrum, very high mass 
clusters constitute a source of noise. One can mask or re- 
move the brightest clusters detected in a SZ map down to 
some threshold density in order to suppress the trispec- 
trum contribution to the total variance in Ci without re- 
moving a large component of the signal from the power 
spectrum. We now demonstrate this in practice using 
our sample of simulated maps. 

To evalulate the sensitivity of the SZ power spectrum 
to (T8, we use a second lightcone simulation to create 
an additional sample of mock SZ maps. The cosmolog- 
ical parameters were chosen to be consis tent with those 
measured from the 5th- year WMAP data (jPunklev et al.l 
[2009). therefore erg — 0.8 for this simulation (compared 
to 0.77 as for our WMAP3 run). The simulated volume 
is a periodic cube of size L — lOOO/i^^Mpc; the parti- 
cle mass nip = 6.82 x 10^°/i~^Mo and the cubic spline 
softening length e = 16.28/i~^kpc. We have applied the 
iBode et al.l (|2007f) gas prescription to the halos in the 
WMAP5 lightcone, and created a second set of mock SZ 
sky maps using the procedure described in the previous 
section. We refer to the as = 0.77 maps as the WMAP3 
simulation, and the o-g = 0.80 maps as the WMAP5 sim- 
ulation. 

Clusters are masked from the simulated maps in the 
following manner. For each of the 96 maps in the 
WMAP3 & 5 samples, there is an associated catalogue 
listing all clusters in map. We combine all 96 catalogues 
and rank-order clusters by their integrated SZ flux Y . 
Starting with the brightest cluster, we then fit and re- 
move an isothermal /3-model (with j3 fixed at 1) from 
the map at the location of the cluster. This is then 
repeated until all clusters down to the desired surface 
density threshold rith have been masked. This process 
can be performed in practice on real SZ survey maps by 
ranking detected clusters by their detection significance, 
and removing each in turn down to the required thresh- 
old surface density (and is somewhat analogous to the 
application of an optirnal matched-filt er, Herran z et al.l 
l2002HMelin et al.ll2006HSchafer et al.ll2006l) . 

The upper panel of Figure [5] demonstrates the mean 
power spectrum measured with no clusters masked (black 
line), and having masked clusters down to 0.01, 0.1 and 
0.9 deg"^ (blue, green and red lines). The solid lines 
show the results for our WMAP3 simulation, the dashed 
lines for our WMAP5 lightcone simulation. For minimal 
masking, power is removed only on large angular scales, 
shifting to smaller scales as increasing numbers of clus- 
ters are masked. Removing 0.01 clusters per square de- 
gree barely changes the average power spectrum, whilst 
masking 0.1 clusters per square degree reduces power by 
less than a third a,t £ — 1000, and negligibly aX t> 3000. 
Masking 0.9 clusters deg~^ removes power over a wide 
range of scales, reducing the mean band-averaged power 
by a factor of 2.5 at ^ = 1000 and by 27% at ^ = 5000. 
As expected, the most massive clusters contribute most 
of their power at large angular scales, and that at scales 
of £ > 5000 low mass clusters contribute the majority of 
the signal. These results are similar to those shown in 
Figured 

In the lower panel of Figure [6] we plot the sensitivity of 
the SZ power spectrum to the difference in erg between 
the two simulation samples (Acrg = 0.03) including clus- 



ter masking, divided by the sensitivity for the unmasked 
case. We define sensitivity as being the difference in Ct 
between the two simulations divided by the standard de- 
viation in Cf, cr{Ci) measured in the ug = 0.77 sample 
of maps. The line colours are the same as in the upper 
panel. It is immediately evident that whilst removing 
down to 0.01 clusters per square degree has little impact 
on the mean power spectrum, there is a clear gain in sen- 
sitivity below £ = 1000. Masking just the 10 brightest 
clusters per 100 deg^ improves the sensitivity by a fac- 
tor of two at ^ = 1000 and 1.5 at £ = 2000. Masking 
the 90 brightest clusters in the same area enables similar 
improvements at £ = 2000 and 4000. The improvement 
in sensitivity measured for our simulated maps agrees 
well with the analytic predictions from the halo model 
approach (see the lowest panel in Figure [2]). 

In addition, as one masks out the brightest clusters the 
PDF of Ci becomes significantly more Gaussian. This 
is demonstrated in Figure [7l in which we plot the dis- 
tribution of Ci/Ce at 1 = 1000, 2500, 6000 and 10,000 
for our WMAP3 sample._ The unshaded, dashed his- 
togram represents P(Cf/Cf) for the unmasked case, the 
shaded histogram represents P(C£/C^) having masked 
bright clusters down to 0.9 clusters per square degree. 
For the unmasked case, it is clear that the level of non- 
Gaussianity is significantly greater at large scales, where 
massive clusters contribute most of their power. It is 
also immediately apparent that the masked distribution 
is significantly more Gaussian than the unmasked case. 
This can be quantified by the reduction in skewness be- 
tween the masked and masked distributions, which is 1.0, 
0.5, 0.2 and 0.1 at £ = 1000, 2500, 6000 and 10,000. 

4. IMPACT OF FOREGROUNDS AND NOISE 

We have demonstrated the utility of masking clusters 
from SZ sky maps to reduce the non-Gaussian variance 
in the amplitude of the SZ power spectrum and thus 
improve the precision with which one can measure the 
ensemble-averaged Ci. However, there still remains a 
question of the overall gain in sensitivity once one ac- 
counts for astrophysical foregrounds, such as the pri- 
mary CMB temperature anisotropics and the contribu- 
tion from radio and sub-mm point source populations, 
and instrument noise. While masking bright clusters re- 
duces the field-to- field variance in C^, the amplitude of 
the ensemble- averaged Ce is also reduced. Masking in- 
creasing numbers of clusters thus reduces the amount of 
SZ power relative to that of the foregrounds. The vari- 
ance in the foreground signal is proportional to C^fg, 
therefore as the SZ signal is reduced by masking, the to- 
tal fractional error on the SZ amplitude due to the error 
in the foreground signals increases. The relevant ques- 
tion is, how does the total intrinsic error (Gaussian plus 
non-Gaussian) on the SZ signal cr(C^.sz) compare to the 
error (or uncertainty) on the the primary CMB and point 
source power spectra amplitudes within a given field? 

If <TnG dominates the error in the total power measured 
on the sky, C^^sky, where 

,sky = Ce^sz + Ce^CMB + Cg^ps + Qjnst , (5) 

then one can simply remove the foreground components 
from the measured power spectrum, and clipping the 
bright clusters will improve the precision on the mea- 
sured Ci^sz- On the other hand, if the error due to cosmic 
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Fig. 7. — The distribution of CijCg measured for each simulated BO gas model field in our WMAP3 sample, where is the mean over the 
whole sample of 96 maps, at four different angular multipole numbers: I = 1000, 2500, 6000 and 10, 000. The unshaded, dashed histogram 
shows the results without masking clusters. The shaded histogram represents results having masked the brightest 0.9 clusters/deg^ from 
the maps. 



variance on C^^cmb or C^^ps is comparable to <t(C^,sz) 
then reducing the amphtude of the SZ signal by masking 
clusters may degrade the measured SZ power spectrum 
amplitude. 

For mm- wavelength experiments, the variance on the 
total power spectrum due to all sources can be written as 
the sum in quadrature of the Gaussian sample variance 
term (including CMB, point sources, Gaussian SZ error 
and instrument noise), the non-Gaussian SZ error and a 
theory error, 

O" (Q.sky ) = (CTcmb + fsz.G + ^ps + 0"inst) + ^sz nG + ^th ' 

(6) 

where we have included an error on the predicted SZ 
power spectrum, tr^jj, due to the uncertainties in model- 
ing cluster gas physics. Using Eqn. [3] this can be rewrit- 
ten as 

wt5^ = ['^"^''"'^ 

where s(^,nth) = CTnc/fG and M(£,nth) is the ratio of 
the masked to unmasked C^^sz's for a given mask surface 
density threshold nth, and t is the fractional error on 
Ci.sz due to the theoretical uncertainties in cluster gas 
physics. F is defined as the ratio of foreground to SZ 
power, 

Ce.cMB + Ce,ps + B^'^Ct^mst ,r,\ 
^ i«) - — '■ 7=; : l^j 

(-^£,sz 



where B is the Fourier domain profile of the telescope 
beam. We henceforth assume a 10% theoretical uncer- 
tainty on Ci, and lOfiK instrument noise in an arcminute 
pixel for a single-frequency survey at 90Ghz of 25 deg^ 
of sky with a Gaussian beam of FWHM 1.67 arcmin- 
utes. The values of s are measured from our WMAP3 
simulation SZ map sample, with A£ — 300. The point 
source power spectrum is calculated assuming radio and 
sub-mm source populations arid spec tral index distribu- 
tions a s given in lde Zotti et al.l (|2005( ) and lNegrello et al] 
(|200l . 

The upper panel of Figure [5] shows the total fractional 
error on Ce^sz as a function of angular multipole number. 
The solid black line gives the total fractional error (as 
given in Eqn. [7]) for no masking {Al — 1). The black dot- 
dashed, dashed and dotted lines show the contribution 
of the SZ trispectrum, the total error on the combined 
foreground signals (including instrument noise), and the 
theory uncertainty on Ci, respectively. The red lines 
show the contributions of the CMB plus Gaussian SZ 
variance (dashed) and that of the point sources and in- 
strument noise (dotted). It is evident that in the range 
2300 < £ < 5000 the non-Gaussian variance in the SZ 
power spectrum amplitude is the dominant source of er- 
ror on Ci^sz, with cosmic variance in the CMB power 
dominating at larger scales, and that of the point sources 
and instrument noise dominating at smaller scales. It is 
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clear that in this range, C^^sz can be more accurately 
measured if one can reduce the non-Gaussian variance 
CnG by masking clusters. 

The lower panel of Figure [S] demonstrates the reduc- 
tion in the fractional error on C^^sz having masked bright 
clusters down to the surface density threshold nth (given 
in the legend), compared to the unmasked case. For 
i < 2000, the Gaussian variance on the CMB amplitude 
dominates and so masking down to 0.1 clusters deg~^ 
(dashed line) merely reduces the amplitude of the SZ 
signal with no associated reduction in the error {a{Ci)). 
Thus the total fractional error increases relative to the 
unmasked case. However, masking clusters down to 0.25 
per square degree (dotted) gives a 20% reduction in the 
fractional error on Ce^sz compared to the unmasked case 
in the range 3000 < £ < 4000. We have found that 
0.9 clusters deg~^ (dash-dotted) gives the best results; 
more aggressive masking reduces the amplitude of the SZ 
power to the point at which the error on the foregrounds 
becomes significant. Above £ — 6000, the variance due 
to point sources and instrument noise dominates and so 
masking clusters simply diminishes the amplitude of the 
SZ signal without an associated reduction in the variance. 
As at ^ < 2000, this results in a decreased sensitivity to 
Ci^sz at these scales. The red dotted line shows the re- 
duction in the fractional error (for nth = 0.9) if one can 
perfectly remove the primary CMB contribution using 
multi- frequency observations. 

Overall, our results show that, after accounting for the 
various foregrounds, for a simulated single frequency sur- 
vey with lOfiK instrumental noise a 20% improvement 
in the measurement of the amplitude of the SZ angular 
power spectrum can be obtained if one masks out the 
brightest clusters from the map down to a surface den- 
sity threshold of at least 0.25 clusters degree"^. This 
corresponds to a reduction of « 3% on the error on trgj 
although we note the additional gain in the Gaussianized 
statistics of P(C^,sz)- 

5. CONCLUSIONS 

We have demonstrated that the SZ power spectrum 
has a significant amount of variance contributed by rare 
massive clusters. While these objects do not contribute 
heavily to the ensemble-average SZ power spectrum, they 
do add significant non-Gaussian noise to measurements 
of the power spectrum amplitude and thus degrade cos- 
mological utility. Furthermore, it is difficult to use nu- 
merical simulations to characterize the statistics of the 
SZ power spectrum because one must have sufficiently 
large volumes to be able to adequately sample the rarest 
objects in the Universe. We have also shown that the 
magnitude and scale dependence of the non-Gaussian 
variance is sensitive to the modelling of the intra-cluster 
gas. 

A simple solution is to mask out the largest objects 
in the map before estimating the power spectrum. We 
have demonstrated using both halo model calculations 
and a large sample of simulated SZ sky maps that fairly 
conservative masking can reduce the variance and Gaus- 
sianize the statistics significantly, increasing the precision 
of the measured Ce and thus the sensitivity to cosmolog- 
ical parameters. For example, masking the 10 brightest 
clusters in a 100 deg^ SZ map will improve the preci- 
sion on C£ by a factor of two at £ — 1000 and 1.5 at 
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Fig. 8. — (Upper) Fractional error on gz as a function of 
angular multipole number. The solid black line gives the total 
fractional error as given in Eqn. [7] The black dot-dashed, dashed 
and dotted lines show the contributions of the SZ trispectrum, 
the combined foreground signals (plus Gaussian SZ variance), and 
the theory uncertainty on C'l, respectively. The red dashed line 
shows the contribution of just the primary CMB and Gaussian 
SZ variance; the red dotted line shows that of the point sources 
plus instrument noise. (Lower) The total fractional error on Ci^^z 
having masked bright clusters from the map divided by the error 
on having applied no masking. In the range 3000 < £ < 6000 

(where the SZ trispectrum is the dominant error) removing bright 
clusters from the map increases the precision of measurements of 
the SZ power spectrum. 

£ — 2000. Masking the 90 brightest clusters in the same 
region should improve the accuracy by the same factors 
at £ = 2000 and 4000. This is in agreement with work 
related to information content in the matter power spec- 
trum in the trans-linear regime, where it has been found 
that a few massive objects arc responsib le for much of the 
varian ce (jNevrinck fc Szapudil 120081) . iTakada fc Bridld 
(|2007f ) have also demonstrated that knowledge of the 
number of massive clusters within a given survey field 
can help improve the precision of measurements of the 
weak lensing shear power spectrum. 

Choosing which objects to mask is non-trivial; we 
found that using a fixed sky density produced a well- 
defined and well-behaved estimate that can easily be ap- 
plied to real maps. This skirts the issue (at least par- 
tially) of finding an observable that is well-correlated 
with mass, and can be easily applied to any survey (for 
example, weak lensing). 

We have also shown that the non-Gaussian variance in 
the SZ power spectrum remains a significant source of 
error in the range 2300 < £ < 5000, even in the pres- 
ence of the astrophysical foregrounds provided by the 
primary CMB temperature anisotropics and unresolved 
point source populations. We have demonstrated that 
even single frequency experiments with no ability to spec- 
trally subtract the CMB and point sources, can improve 
the accuracy on Ci^sz by more than 20% in the range 
3000 < £ < 4000 by masking the brightest clusters down 
to between 0.25 and 0.9 clusters per square degree. 

We emphasize that one is not losing cosmological in- 
formation by masking bright clusters since our proce- 
dure is a way to discard the signal from massive clus- 
ters whose information we do not know how to analyze 
but which still contributes to the variance. Halo model 
calculations of the SZ power spectrum demonstrate that 
the ensemble-averaged power spectrum (C^,sz) - to which 
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high mass clusters do not con tribute siRnificaiitly - ha s 
a sensitive dependence on as (IKomatsu fc Seliaki[200l) . 
If one had prior knowledge of the number density and 
gas properties of clusters over a wide mass and red- 
shift range it would be possible to construct an optimal 
power spectrum estimator in which the signal from high 
mass clusters would be down- weighted. However, in this 
case one would already know the cluster mass function ~ 
which is more sensitive to cosmological parameters - and 
thus power spectrum measurements would reveal no ad- 
ditional information. In the absence of complete knowl- 
edge of the mass function, one can simply apply zero 
weight to the brightest clusters as they effectively con- 
tribute only noise to the power spectrum. On the other 
hand, masked clusters carry much statistical weight when 
used to measure the cluster mass function. 

There are still theory challenges to predicting the de- 
tailed shape and amplitude of the SZ power spectrum, 
but we have demonstrated that the field-to-field variance 
can be reduced to allow precise tests of the physics and 



astrophysics involved. 
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